Image Description Mining and Hierarchical Clustering on Data Records Using HR-Tree

نویسندگان

  • Congle Zhang
  • Sheng Huang
  • Gui-Rong Xue
  • Yong Yu
چکیده

Since we can hardly get semantics from the low-level features of the image, it is much more difficult to analyze the image than textual information on the Web. Traditionally, textual information around the image is used to represent the high-level features of the image. We argue that such “flat” representation can not describe images well. In this paper, Hierarchical Representation (HR) and HR-Tree are proposed for image description. Salient phrases in HRTree are further to distinguish this image with others sharing the same ancestor concepts. First, we design a method to extract the salient phrases for the images in data records. Then HR-Trees are built using these phrases. Finally, new hierarchical clustering algorithm based on HR-Tree is proposed for users’ browsing conveniently. We demonstrate some HR-Trees and clustering results in experimental section.. These results illustrate the advantages of our methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Duplicate Detection Using Hierarchical Clustering of Records

Accuracy and validity of data are prerequisites of appropriate operations of any software system. Always there is possibility of occurring errors in data due to human and system faults. One of these errors is existence of duplicate records in data sources. Duplicate records refer to the same real world entity. There must be one of them in a data source, but for some reasons like aggregation of ...

متن کامل

Using Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach

Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...

متن کامل

Enhancement Clustering of Cloud Datasets using Improved Agglomerative Technique

Enhancement Clustering of Cloud Datasets using Improved Agglomerative Technique Prof. Madhuri h Parekh Smt. J.J.Kundaliya Commerce College, Rajkot, Gujarat, India. Email: [email protected] ----------------------------------------------------------------------ABSTRACT------------------------------------------------------------Cloud computing is the latest technology that delivers computing...

متن کامل

Auto-assemblage for Suffix Tree Clustering

Due to explosive growth of extracting the information from large repository of data, to get effective results, clustering is used. Clustering makes the searching efficient for better search results. Clustering is the process of grouping of similar type content. Document Clustering; organize the documents of similar type contents into groups. Partitioned and Hierarchical clustering algorithms ar...

متن کامل

بررسی میزان تأثیر داروهای درمان ناباروری در بیماران نابارور با استفاده از الگوریتم خوشه بندی و تکنیک های داده کاوی

Background and purpose: The rate of infertility has increased throughout the world. Data mining is a new method for analyzing information from databases. Few studies are done regarding infertility and using data mining in describing and predicting different treatment methods and factors influencing these methods. This paper proposes a model for evaluating the efficacy of different drugs in trea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006